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Abstract: 


powerful information encapsulation ability and convenient variational optimization method. In particular, there have 


Artificial Neural Network (ANN) has become a powerful tool in the field of scientific research with its 


been many recent advances in computational physics to solve variational problems. Deep neural network (DNN) is 
used to represent the wave function to solve quantum many-body problems using variational optimization. In this work 
we used a new Physics-Informed Neural Network (PINN) to represent the Cumulative Distribution Function (CDF) of 
some classical problems in quantum mechanics and to obtain their ground state wave function and ground state energy 
through the CDF. By benchmarking against the exact solution, the error of the results can be controlled at a very low 
level. This new network architecture and optimization method can provide a new choice for solving quantum many- 


body problems. 
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1 Introduction 


With the rapid growth of data and computing resources, 
Machine Learning(ML) technology is playing an increas- 
ingly important role in more and more areas. For some phys- 
ical phenomena expressed by partial differential equations, 
it is difficult for traditional methods to solve complex sit- 
uations such as high dimensions. Universal approximation 
theorem!!! enables deep neural networks to solve variational 
function problems. The deep learning based method pro- 
posed in this paper is a new solution, by minimizing the dif- 
ference between variational functions and partial differential 
equations, DNN can transform physical problems"! into op- 
timization problems"! that can be solved with modern deep 
learning libraries. 


In the case of variational equations, physical constraints 
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[4-8], In this paper, instead of using DNN 


play an essential role 
to represent the wave function directly, we use DNN to rep- 
resent the cumulative distribution function 三 f(x')dx' of 
the wave function y(x). In order to improve the efficiency 
of data learning, we apply the unitary constraint to the vari- 
ational function, and the whole computational process does 
not need to compute numerical integration, thus greatly re- 
ducing the computational amount. These advantages make 
the method in this paper more suitable for dealing with mul- 
tiple problems with a large amount of computation. 

In this paper, we reviewed that physically constrained 
neural networks can be used as efficient trial wave func- 
tions in solving Schrodinger equations with the harmonic os- 
cillator and the Woods-Saxon. We have also extended this 
method to Schrodinger equation with the double-well poten- 


tials in the same framework. 


2 Methods 


Unlike the previous direct expression of the wave 
function by the Variational Artificial Neural Networks 
(VANN))?!], we obtain the wave function indirectly 
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through the CDF function, and the relationship between the 
two is as follows 
Aly) = Eyly) (3) 
where H = -#Ż vy? + V(x) is the Hamiltonian operator and 


F(x) = | w(x" yr (x! dx! (1) 


v= fo (2) 
x 


where F(x) is the CDF, y(x) is the wave function. Auto dif- 
ferentiation (auto-diff)""! tools in deep learning libraries can 
be used to compute the derivative d F/dx. The wave function 
obtained by the CDF satisfies the normalisation, thus avoid- 
ing numerical integration and simplifying the calculation. 
Furthermore, the value range of the CDF is between (0, 1), 
which makes it much easier to train. Due to the above two 
advantages, our algorithm has less computation and training 


cycles, while maintaining good accuracy. 


The input of the neural network is the n-dimensional 
spatial coordinates x. The first layer of the DNN has m = 32 
hidden neurons whose values are computed by h; = o(xW,+ 
b,), where W, is the weight matrix with n x m elements and 
b, is the bias vector with m values. To avoid linear super- 
position, a non-linear function o, called the activation func- 
tion, is introduced, which can improve the representational 
ability of the DNN. The second layer works similarly, taking 
the output of the first layer as the input of the second layer 
h, = o(h,W, + b,), allowing multiple hidden layers to be 
stacked. The last layer returns the value of the CDF func- 


tion. 


Since the CDF function is monotonic, the variational 
function of the DNN fitting must also be monotonic. To solve 
this problem, we add a non-negative constraint to the weight 
W,. Theoretically, nonlinear functions such as sigmoid, tanh 
and relu can all be used as the activation function of the DNN, 
but in practice, since we need continuous and monotonic ac- 
tivation functions, and the derivative cannot be 0, we finally 
choose the sigmoid function as the activation function, which 
satisfies the above conditions. And the sigmoid function has 
a range of (0, 1) ,;which guarantees the output range of the last 
layer. 


The goal of the training is to minimise the violation 
between the wave function obtained by the DNN and the 
Schrodinger equation, so that the ground state energy and the 
corresponding wave function can be found. 


Eo represents its smallest eigenvalue. The loss function is 


defined as follow, 
L(@) = (H — Elw) + |E Qmin)| + LF max) — 11 (A) 


where 0 is a trainable parameter, including W,, b, and Ep, 
Eo is another trainable parameter initialised to 0. DNN guar- 
antees that y satisfies the Schrodinger equation by minimis- 
ing the first term, and the other two terms guarantee that the 
CDF function fitted by DNN satisfies the boundary condi- 
tions. The range of the CDF provides normalisation of the 
wave function, which simplifies the calculation. 

Note that the initial values of the trainable parameters 
W and b, are usually initialised randomly or in the Xavier 
scheme!!3], In our work, however, the initialisation scheme 
of the parameter has little effect on the result, so we use ran- 
dom initialisation. 

We add another loss term e’“», where y is a small value. 
By subtracting this term from the loss function, integration is 
avoided and DNN optimizes the loss function to make Eo 
lower. By training this energy term, the ground state en- 
ergy and the ground state wave function can be obtained with 
small error!“ . 

We tested the performance of the DNN Schrodinger 
equation solver on three classical quantum mechanical prob- 
lems. The first problem is the harmonic oscillator model!!5!, 
the potential of the harmonic oscillator can be written as, 


yV = Tma? (5) 


where m is the mass of the oscillator, œ is its angular fre- 
quency and x is its deviation from equilibrium position. 


The second example is to solve Schrodinger equation 


with Woods-Saxon potential !!®!, 


V=—_ (6) 


-Ro 
l+e % 


where ao is the thickness of the surface layer and the average 
radius is given as Rp. 

The third is a double-well potential problem, which is a 
new potential added to our previous work""4!. The potential 


in the Hamiltonian can be written as: 


V = —e hxtl-a)(xti+a) A e7 -l+a)(x-l-a) (7) 
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Where h,/,a respectively control the depth, position, and 
width of the double-well. Unlike the first two questions,the 
ground state wave function of a double-well potential prob- 
lem will exhibit two peaks. Applying our method to such a 
potential can further verify the universality of the architec- 


ture. 


For simplicity, some parameters are set as follows, 


h=m=o = 1, R} = 62,4) =0.1,h = 4,a = 0.5,1 = 1.5 
(8) 


The input to DNN is an unordered list of domain coor- 
dinates. The ground state energy and the CDF function are 
obtained by minimising the terms of the loss function, and 
thus the ground state wave function is obtained indirectly. 
Unlike the previous method of solving the Schrodinger equa- 
tion using supervised learning, our method belongs to unsu- 
pervised learning. The wave function is usually very close 
to the exact wave function after training the DNN through 
iterations in harmonic oscillator, Woods-Saxon potential, in- 
finitely high potential well04 and a double-well potential. 
DNN can get very accurate results after 2000 iterations, but 
we trained 10,000 times, using a very small learning rate in 
the last 1,000 times. 


14,17-18] that add momentum 


We used Adam algorithm! 
mechanism and adaptive learning rate to the simple stochas- 


tic gradient descent, 
O41 =9,- i 一 (9) 


where 0 represents the model parameters, including weights 
and biases within a neural network, The term n denotes the 
current iteration number, reflecting the step in the iterative 
process of optimization. ’Ir’, the learning rate, is a critical 
hyperparameter that controls the size of the steps taken to- 
wards the minimum of the loss function. m is the batch size, 
which determines the number of samples used for each pa- 
rameter update. DY represents the gradient of the loss func- 
tion /; with respect to 0, guiding the direction and magnitude 
of parameter adjustments. 

We use a learning rate scheduler to speed up the training, 
setting a higher learning rate in the early stages to speed up 
convergence, and a lower learning rate in the later stages to 


reach the global minimum. 


To quantify the difference between the wave function 
obtained by DNN wpNN and the real wave function Wye, WE 


introduce the partial wave fidelity K, 


2 
_ < Wirel WpNN > 

< WeruelWirue >< WDNN|WDNN > 

The closer the K is to one, the closer the result of DNN 


is to the exact wave function. 


(10) 


3 Results 


For the harmonic oscillator problem, DNN can control 
the ground state energy error within 0.06% after 1500 train- 
ing, and reduce the error to 0.002% after 10000 training, the 
partial wave fidelity K is 0.9999967 when using 4 hidden 
layers with 16 units per layer. To study the influence of the 
number of variational parameters on the training results, K is 
discussed in Ref.[14] by using different numbers of hidden 
layers and units per layer. 

To understand the difference between the wave function 
obtained by DNN and the real wave function, we calculate 
the CDF function, the wave function and the first and second 
derivatives of the wave function respectively. 
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Fig. 1 One-dimensional harmonic oscillator’s CDF func- 


tion04.(The image on the right is a partial enlargement of 
the image on the left) 


The error between the CDF function fitted by DNN and 
the real CDF function is within 0.0001 as shown in Fig. 1. 
The error range of the ground state wave function can be 
controlled within 0.0002 as shown in Fig. 2. In addition, 
the accuracy of the first and second derivatives of the wave 
function obtained by the DNN is also very high, which means 
that the DNN has learned the real physical information of the 
wave function. 

We verified the universality of the method by using the 
same network to solve the Schrodinger equation with the 
Woods-Saxon potential and the double-well potential. For 
the Schrodinger equation with Woods-Saxon potential en- 
ergy, using the same parameter y in the harmonic oscillator 


potential, the Woods-Saxon potential could control the error 
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Fig.3 The ground state wave function in the Woods-Saxon po- 


tential energy041. 


function obtained from the DNN also agrees very well with 
the exact solution. The error range can be controlled within 
0.0002 for the ground state wave function of the Woods- 
Saxon potential as shown in Fig.3. And the ground state en- 
ergy calculated by the DNN for the Woods-Saxon potential 
problem is -0.97382, which is also within an error of 0.002% 
to the exact result of -0.97385 and K = 0.999964. 

However, compared with the results of harmonic oscil- 
lator and Woods-Saxon potential, because the ground state 
energy is too close to the first excitation state energy under 
the double-well potential, we need to appropriately increase 
the energy layer parameters y so that DNN gives priority to 
energy optimization, in which case the ground state energy 
error of the double-well potential is less than 0.5%. In addi- 
tion, we make the DNN more expressive by increasing the 
number of hidden layers, which brings the value of K closer 
to 1. The relative error of the ground state energy can be kept 
within 0.5% and K remains stable around 0.99734 for the 
double-well potential problem. It will still maintain a high 
level of accuracy, although it will not be able to reach the 
accuracy of the first two problems. 


4 Conclusions 


We use DNN to fit the CDF function of the ground state 
wave function, so that the resulting wave function automat- 
ically satisfies the normalisation and simplifies the compu- 
tation. By minimising the violation of the trial wave func- 
tion and the trial ground state energy Ep to the Schrodinger 
equations, the variational problem is reduced to an optimiza- 
tion problem. This method improves the accuracy of the 
Schrodinger equation solution with less computation. The 
effectiveness of this method is confirmed by solving the har- 
monic oscillator, the Woods-Saxon potential and the double- 
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well potential with small relative error. 


E, = WHIv) 
0 wily) í ind 
for both the numerator and the denominator in traditional 


where numerical integration is required 


variational methods for solving quantum mechanical prob- 


[9-10] the form of 


lems. In another DNN Schrodinger solver 
a trial wave function should be approximated to the exact 
solution for eliminating fluctuation and the initial values of 
the network parameters also greatly affect the optimization 
results. In our framework, the goal is to minimise the viola- 
tion of the Schrodinger equation on the sampled spatial co- 
ordinates. The trial wave function is automatically normal- 
ized based on the constraints of the constructed neural net- 
work. The normalized ground wave function improves the 
efficiency of the Eo calculation with a lower relative error. 
Most importantly, our algorithm is more universal and pro- 
vides the possibility to solve problems that have never been 
solved before, because it can directly ignore the pre-training 
process and does not need to know any information about the 
exact solution before training. 

There are a number of improvements that can be made 
to the current method. For example, active learning or coor- 
dinate sampling by the learned wave function can be used to 
improve training efficiency. We could improve the accuracy 
in solving the double-well potential problem by optimising 
the structure of our framework. The DNN framework will 
be optimized by solving the Schrodinger equation with a 3- 
dimensional potential. It may also open up a new way to 


solve many nucleon problems. 
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用 软 约 束 单调 神经 网 络 求解 薛 定 谓 方 程 
李 汉 林 !， 刘 轩 !， 薄 开放 !， 庞 龙 刚 ? 


(1. 理学 院 ， 武 汉 科 技 大 学 , 湖北 省 武汉 市 430065; ; 
2. 华中 师范 大 学 粒子 物理 研究 所 ， 硅 克 与 轻 子 物理 教育 部 重点 实验 室 ， 湖 北 省 武汉 市 430079 ) 


摘要 : 人 工 神经 网 络 (Artificial Neural Network, ANN) 以 其 强大 的 信息 封装 能 力 和 方便 的 变 分 优化 方法 成 为 科 
学 研究 领域 的 有 力 工 具 。 特 别 是 最 近 在 计算 物理 解决 变 分 问题 方面 取得 了 许多 进展 。 用 深度 神经 网 络 (DNN) 
表示 波 函 数 来 求解 变 分 优化 的 量子 多 体 问题 时 ,我 们 使 用 一 种 新 的 物理 信息 神经 网 络 (PINN) 来 表示 量子 力学 
中 一 些 经 典 问题 的 累积 分 布 函 数 (CDF)， 并 通过 CDF 获得 他 们 的 基态 波 函 数 和 基态 能 量 。 通 过 对 精确 解 的 基 
准 测试 ， 可 以 将 结果 的 误差 控制 在 很 小 范围 。 这 种 新 的 网 络 结构 和 优化 方法 为 解决 量子 多 体 问题 提供 了 新 的 
选择 。 
关键 词 : 深度 神经 网 络 ; 变 分 问题 ; 累积 分 布 函数 ; 基态 波 函 数 
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